Mapping Scrambled Korean Sentences into English Using Synchronous TAGs

نویسنده

  • Hyun S. Park
چکیده

plg /9 50 50 29 13 M ay 1 99 5 Mapping Scrambled Korean Sentences into English Using Synchronous TAGs Hyun S. Park Computer Laboratory University of Cambridge Cambridge, CB2 3QG, U.K. [email protected] Abstract Synchronous Tree Adjoining Grammars can be used for Machine Translation. However, translating a free order language such as Korean to English is complicated. I present a mechanism to translate scrambled Korean sentences into English by combining the concepts of Multi-Component TAGs (MC-TAGs) and Synchronous TAGs (STAGs). 1 Motivation Tree Adjoining Grammars (TAGs) were rst developed by Joshi, Levy, and Takahashi (Joshi et al., 1975). There are other variants of TAGs such as STAGs (Shieber and Schabes, 1990), and MC-TAGs (Weir, 1988). STAGs in particular can be used for machine translation and were applied to KoreanEnglish machine translation in a military message domain (Palmer et al., 1995). Park (Park, 1995) suggested a way of handling Korean scrambling using MC-TAGs together with a priority concept. However, as scrambled argument structures in Korean were represented as sets using MC-TAGs, a mechanism to combine MC-TAGs and STAGs was necessary to translate Korean scrambled sentences into English. 2 Korean-English Machine Translation Using STAGs STAGs are a variant of TAGs introduced to characterize correspondences between tree adjoining languages. They can be used to relate TAGs for two different languages for machine translation (Abeill e et al., 1990). The translation process consists of three steps. The source sentence is parsed according to the source grammar. Each elementary tree in the derivation is considered with the features given from the derivation through uni cation. Second, the source derivation tree is transferred to a target derivation. This step maps each elementary tree in the source derivation tree to a tree in the target derivation tree by looking in the transfer lexicon. And nally, the target sentence is generated from the target derivation tree obtained in the previous step. The transfer lexicon consists of pairs of trees, one from the source language and the other from the target language. Within the pair of trees, nodes may be linked. Whenever adjunction or substitution is performed on a linked node in a source tree, the corresponding operation applies to the linked node in the target tree. 264 NP SP NP N Tom i

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mapping Scrambled Korean Senteces into English Using Synchronous TAGs

Synchronous Tree Adjoining Grammars can be used for Machine Translation. However, translating a free order language such as Korean to English is complicated. I present a mechanism to translate scrambled Korean sentences into English by combining the concepts of Multi-Component TAGs (MC-TAGs) and Synchronous TAGs (STAGs). 1 M o t i v a t i o n Tree Adjoining Grammars (TAGs) were first developed ...

متن کامل

Korean Grammar Using TAGs

This paper addresses various issues related to representing the Korean language using Tree Ad joining Grammars Topics covered include Korean grammar using TAGs Machine Translation between Korean and English using Synchronous Tree Adjoining Grammars STAGs handling scrambling using Multi Component TAGs MC TAGs and recovering empty arguments The data for the parsing is from US military telecommuni...

متن کامل

Representing Paraphrases Using Synchronous TAGs

This paper looks at representing paraphrases using the formalism of Synchronous TAGs; it looks particularly at comparisons with machine translation and the modifications it is necessary to make to Synchronous TAGs for paraphrasing. A more detailed version is in Dras (1997a). 1 I n t r o d u c t i o n The context of the paraphrasing in this work is that of Reluctant Paraphrase (Dras, 1997b). In ...

متن کامل

Korean to English Translation Using Synchronous TAGs

It is often argued that accurate machine translation requires reference to contextual knowledge for the correct treatment of linguistic phenomena such as dropped arguments and accurate lexical selection. One of the historical arguments in favor of the interlingua approach has been that, since it revolves around a deep semantic representation, it is better able to handle the types of linguistic ...

متن کامل

The Improvement of Negative Sentences Translation in English-to-Korean Machine Translation

This paper describes the algorithm for translating English negative sentences into Korean in English-Korean Machine Translation (EKMT). The proposed algorithm is based on the comparative study of English and Korean negative sentences. The earlier translation software cannot translate English negative sentences into accurate Korean equivalents. We established a new algorithm for the negative sen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995